Picture for Yu Sun

Yu Sun

Sherman

DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter

Add code
Feb 05, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

Super-Resolution and Denoising of Corneal B-Scan OCT Imaging Using Diffusion Model Plug-and-Play Priors

Add code
Feb 02, 2026
Viaarxiv icon

End-to-end reconstruction of OCT optical properties and speckle-reduced structural intensity via physics-based learning

Add code
Feb 02, 2026
Viaarxiv icon

Accurate Network Traffic Matrix Prediction via LEAD: an LLM-Enhanced Adapter-Based Conditional Diffusion Model

Add code
Jan 29, 2026
Viaarxiv icon

WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Open-Vocabulary Functional 3D Human-Scene Interaction Generation

Add code
Jan 28, 2026
Viaarxiv icon

CORD: Bridging the Audio-Text Reasoning Gap via Weighted On-policy Cross-modal Distillation

Add code
Jan 23, 2026
Viaarxiv icon

Learning to Discover at Test Time

Add code
Jan 22, 2026
Viaarxiv icon

VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction

Add code
Jan 09, 2026
Viaarxiv icon